SemanticScuttle - klotz.me » klotz: fine tuning+lora

klotz: fine tuning* + lora*

mistral-finetune - GitHub

A light-weight codebase that enables memory-efficient and performant finetuning of Mistral's models. It is based on LoRA, a training paradigm where most weights are frozen and only 1-2% additional weights in the form of low-rank matrix perturbations are trained.

2024-06-06 Tags: github, mistral, lora, python, machine learning, fine tuning, llm by klotz
A Step-by-Step Guide to Representation Finetuning LLAMA3

"The paper introduces a technique called LoReFT (Low-rank Linear Subspace ReFT). Similar to LoRA (Low Rank Adaptation), it uses low-rank approximations to intervene on hidden representations. It shows that linear subspaces contain rich semantics that can be manipulated to steer model behaviors."

2024-05-26 Tags: linear subspace, lora, representation, fine tuning, reft, stanford, nlp, python, llm by klotz
Finetune LLMs on your own consumer hardware using tools from PyTorch and Hugging Face ecosystem | PyTorch

efficient method for fine-tuning LLM using LoRA and QLoRA, making it possible to train them even on consumer hardware

2024-01-12 Tags: llm, fine tuning, qlora, lora, peft, pytorch, hugging face, fine-tuning, llms by klotz
Easily Train a Specialized LLM: PEFT, LoRA, QLoRA, LLaMA-Adapter, and More

2023-12-10 Tags: llm, lora, qlora, peft, fine tuning by klotz

First / Previous / Next / Last / Page 1 of 0